Improvement of a structured language model: arbori-context tree

نویسندگان

  • Shinsuke Mori
  • Masafumi Nishimura
  • Nobuyasu Itoh
چکیده

In this paper we present an extention of a context tree for a structured language model (SLM), which we call an arbori-context tree. The state-of-the-art SLM predicts the next word from a xed partial tree of the history tree, such as two exposed heads, etc. An arbori-context tree allows us to select an optimum partial tree of a history tree for the next word prediction depending on the e ectiveness in the similar way that a context tree selects the length of the history (n of n-gram). The experiment we conducted showed that the test set perplexity of the SLM based on an arbori-context tree (79.98) was lower than that of the SLM with a xed history (101.56).

برای دانلود متن کامل این مقاله و بیش از 32 میلیون مقاله دیگر ابتدا ثبت نام کنید

ثبت نام

اگر عضو سایت هستید لطفا وارد حساب کاربری خود شوید

منابع مشابه

On the Development of a Model of Discipline-specific Reading Strategies in the Context of Iranian EFL Learners

Abstract Reading strategies are seen as supportive means to help learners process and comprehend English texts effectively. The present research probed to posit a discipline-specific model of reading strategies for Iranian TEFL postgraduate students. The motive behind developing a local model of reading strategy is twofold: first, a variety of postgraduate students admitted for M.A and Ph.D. pr...

متن کامل

On the Development of a Model of Discipline-specific Reading Strategies in the Context of Iranian EFL Learners

Abstract Reading strategies are seen as supportive means to help learners process and comprehend English texts effectively. The present research probed to posit a discipline-specific model of reading strategies for Iranian TEFL postgraduate students. The motive behind developing a local model of reading strategy is twofold: first, a variety of postgraduate students admitted for M.A and Ph.D. pr...

متن کامل

Studying impressive parameters on the performance of Persian probabilistic context free grammar parser

In linguistics, a tree bank is a parsed text corpus that annotates syntactic or semantic sentence structure. The exploitation of tree bank data has been important ever since the first large-scale tree bank, The Penn Treebank, was published. However, although originating in computational linguistics, the value of tree bank is becoming more widely appreciated in linguistics research as a whole. F...

متن کامل

Code-Copying in the Balochi Language of Sistan

This empirical study deals with language contact phenomena in Sistan. Code-copying is viewed as a strategy of linguistic behavior when a dominated language acquires new elements in lexicon, phonology, morphology, syntax, pragmatic organization, etc., which can be interpreted as copies of a dominating language. In this framework Persian is regarded as the model code which provides elements for b...

متن کامل

A Structured Language Model for Incremental Tree-to-String Translation

Tree-to-string systems have gained significant popularity thanks to their simplicity and efficiency by exploring the source syntax information, but they lack in the target syntax to guarantee the grammaticality of the output. Instead of using complex tree-to-tree models, we integrate a structured language model, a left-to-right shift-reduce parser in specific, into an incremental tree-to-string...

متن کامل

ذخیره در منابع من


  با ذخیره ی این منبع در منابع من، دسترسی به آن را برای استفاده های بعدی آسان تر کنید

برای دانلود متن کامل این مقاله و بیش از 32 میلیون مقاله دیگر ابتدا ثبت نام کنید

ثبت نام

اگر عضو سایت هستید لطفا وارد حساب کاربری خود شوید

عنوان ژورنال:

دوره   شماره 

صفحات  -

تاریخ انتشار 2001